# Synthetic data training
Phi 4 Mini Reasoning GGUF
MIT
Phi-4-mini-reasoning is a lightweight open model built on synthetic data, focusing on high-quality, reasoning-rich data, and further fine-tuned for more advanced mathematical reasoning capabilities.
Large Language Model
Transformers

P
Mungert
3,592
3
Smartshot Zeroshot Finetuned V0.1.2
MIT
A zero-shot classification model fine-tuned based on roberta-base-zeroshot-v2.0-c, enhanced with SmartShot method and synthetic data
Text Classification Other
S
gincioks
119
0
Smolvlm 500M Anime Caption V0.1
Apache-2.0
A vision-language model specialized in describing anime-style images, fine-tuned from SmolVLM-500M-Base, trained on 180K synthetic image/caption pairs generated by large language models.
Image-to-Text English
S
Andres77872
61
0
Gliner Biomed Base V1.0
Apache-2.0
GLiNER-Biomedical Edition is a specialized biomedical named entity recognition model developed based on the GLiNER framework, capable of identifying multiple biomedical entity types.
Sequence Labeling
PyTorch English
G
Ihor
61
2
Gec Spanish BARTO SYNTHETIC
A Spanish grammar correction model based on the BART architecture, trained on the COWS-L2H dataset and 80,984 synthetic data entries, optimized for single-sentence correction
Text Generation
Transformers Supports Multiple Languages

G
SkitCon
118
1
EVA Qwen2.5 72B V0.2
Other
A large language model fine-tuned based on Qwen2.5-72B, specializing in text generation and instruction-following tasks
Large Language Model
Transformers

E
EVA-UNIT-01
392
19
Depth Anything V2 Metric Outdoor Large Hf
Apache-2.0
A fine-tuned version of Depth Anything V2 for outdoor metric depth estimation tasks, trained on the synthetic dataset Virtual KITTI
3D Vision
Transformers

D
depth-anything
3,662
6
Gliclass Large V1.0
Apache-2.0
An efficient zero-shot classifier trained on synthetic data, suitable for topic classification, sentiment analysis, and reranking tasks in RAG workflows.
Text Classification
Transformers English

G
knowledgator
80
5
Gliclass Base V1.0
Apache-2.0
GLiClass is an efficient zero-shot classifier inspired by GLiNER, suitable for text classification, sentiment analysis, and reranking tasks in RAG workflows.
Text Classification
Transformers English

G
knowledgator
152
3
Gliclass Base V1.0 Lw
Apache-2.0
GLiClass is an efficient zero-shot classifier trained on synthetic data, suitable for text classification, sentiment analysis, and reranking tasks in RAG workflows.
Text Classification
Transformers English

G
knowledgator
57
2
Llama 3 Instruct 8B SPPO Iter3
Apache-2.0
A large language model developed in the third iteration using the Self-Play Preference Optimization method based on the Meta-Llama-3-8B-Instruct architecture.
Large Language Model
Transformers English

L
UCLA-AGI
8,539
83
Gemma
Gemma is an advanced open-source model trained on high-quality datasets, supporting different context length requirements.
Large Language Model
G
cortexso
295
1
Gliclass Large V1.0 Init
Apache-2.0
GLiClass is an efficient zero-shot classifier trained on synthetic data, suitable for topic classification, sentiment analysis, and reranking tasks in RAG workflows.
Text Classification
Transformers English

G
knowledgator
85
13
T5 Base Spell Correction Fr
MIT
This model is based on the T5 architecture, specifically designed to correct spelling and punctuation errors in French text.
Text Generation
Transformers French

T
fdemelo
249
2
Bert Base Cased NER Reranker
MIT
BERT-based Named Entity Recognition (NER) context reranking model for evaluating the helpfulness of contextual sentences for NER predictions
Sequence Labeling
Transformers English

B
compnet-renard
84
0
Dhenu Vision Lora 0.1
Apache-2.0
An agricultural disease detection model fine-tuned based on Qwen-VL-chat, specializing in disease identification and treatment recommendations for three major crops: rice, corn, and wheat.
Text-to-Image
Transformers English

D
KissanAI
96
9
Sage Mt5 Large
MIT
A Russian and English spelling correction model based on the mT5-large architecture, normalizing words to correct spelling and typographical errors.
Large Language Model
Transformers Supports Multiple Languages

S
ai-forever
51
7
Nous Hermes 2 Mistral 7B DPO AWQ
Apache-2.0
Nous Hermes 2 is a next-generation flagship 7B Hermes model based on Mistral 7B DPO, optimized with DPO and demonstrating excellent performance across multiple benchmarks.
Large Language Model
Transformers English

N
solidrust
84
8
Openmath CodeLlama 7b Python Hf
The OpenMath model is specifically designed for solving mathematical problems by integrating textual reasoning with Python interpreter-executed code blocks. Trained on the OpenMathInstruct-1 dataset containing 1.8 million math problem-solution pairs.
Large Language Model
Transformers Supports Multiple Languages

O
nvidia
83
7
7B
A 7B-parameter causal language model compatible with Meta LLaMA 2 architecture, outperforming similar models under 33B in multiple evaluations
Large Language Model
Transformers Supports Multiple Languages

7
CausalLM
177
135
Phi Hermes 1.3B
Other
Phi-1.5 model fine-tuned on the Hermes dataset, primarily used for text generation tasks
Large Language Model
Transformers English

P
teknium
45
44
Esm2 T6 8M UR50D Sequence Classifier V1
MIT
A small sequence classifier trained based on the ESM-2 protein language model, capable of classifying protein sequences into three categories: enzymes, receptor proteins, and structural proteins.
Protein Model
Transformers English

E
AmelieSchreiber
30
0
Airoboros 13b
This is a 13-billion-parameter language model based on the LlaMa architecture, fine-tuned using synthetic data, primarily for research purposes.
Large Language Model
Transformers

A
jondurbin
229
107
Trocr Base Ckb
An OCR system based on Transformer architecture, specifically designed for recognizing Central Kurdish text, trained using synthetic data.
Text Recognition
Transformers

T
razhan
19
0
Trocr Base Printed Synthetic Dataset Ocr
A fine-tuned printed text recognition model based on microsoft/trocr-base-printed, optimized for synthetic OCR datasets
Text Recognition
Transformers English

T
DunnBC22
65
1
Paraphraser Bart Large
Apache-2.0
An automatic paraphrase model based on BART-large architecture, trained on the ParaBank 2 dataset, capable of generating high-quality English sentence paraphrases
Text Generation
Transformers

P
stanford-oval
289
13
T5 Base Multi Sentence Doctor
A T5-based model for correcting sentence errors in English, German, and French texts
Large Language Model
Transformers

T
flexudy
341
45
Featured Recommended AI Models